计算机与现代化 ›› 2012, Vol. 1 ›› Issue (1): 22-24,5.doi: 10.3969/j.issn.1006-2475.2012.01.006

• 人工智能 • 上一篇    下一篇

一种改进的K_means聚类方法

胡 伟   

  1. 山西财经大学实验教学中心,山西 太原 030006
  • 收稿日期:2011-07-18 修回日期:1900-01-01 出版日期:2012-01-10 发布日期:2012-01-10

An Improved K_means Clustering Algorithm

HU Wei   

  1. Experimental Teaching Center, Shanxi University of Finance and Economics, Taiyuan 030006, China
  • Received:2011-07-18 Revised:1900-01-01 Online:2012-01-10 Published:2012-01-10

摘要: 针对传统K_means聚类方法采用随机选择初始聚类中心而导致的收敛速度慢的问题,本文结合空间中的距离度量提出一种改进的K_means聚类算法。该方法通过给出有效的启发式信息,选择较好的聚类中心,减少聚类达到稳定状态所需要的迭代步骤,加速算法的执行。标准数据集上的实验结果表明,与传统的K_means聚类方法相比,本文提出的改进的聚类方法收敛速度快,从而在较少的迭代后得到良好的聚类效果。

关键词: K_means聚类, 初始聚类中心, 启发式信息, 收敛速度

Abstract: This paper presents an improved clustering model based on distance measurement, in order to solve the problem of slow convergence rate of traditional K_means clustering method by selecting initial cluster centers randomly. By using effective heuristic information, this method selects better clustering centers and reduces the iteration steps of attaining stable clustering state. Then the speed of algorithm is accelerated. Simulation results on UCI datasets demonstrate that comparing with traditional K_means clustering means, the improved K_means has fast convergence rate and the better clustering results are obtained by this model after less iterations.

Key words: K_means clustering, initial clustering center, heuristic information, convergence rate

中图分类号: